智能论文笔记

Exponential concentration and untrainability in quantum kernel methods

Supanut Thanasilp , Samson Wang , M. Cerezo , Zoë Holmes

分类：机器学习 | (统计)机器学习

2022-08-23

量子机学习（QML）中的内核方法最近引起了人们的重大关注，作为在数据分析中获得量子优势的潜在候选者。在其他有吸引力的属性中，当训练基于内核的模型时，可以保证由于训练格局的凸度而找到最佳模型的参数。但是，这是基于以下假设：量子内核可以从量子硬件有效获得。在这项工作中，我们从准确估计内核值所需的资源的角度研究了量子内核的训练性。我们表明，在某些条件下，可以将量子内核在不同输入数据上的值呈指数浓缩（在量子数中）指向一些固定值，从而导致成功训练所需的测量数量的指数缩放。我们确定了可以导致集中度的四个来源，包括：数据嵌入，全球测量，纠缠和噪声的表达性。对于每个来源，分析得出量子内核的相关浓度结合。最后，我们表明，在处理经典数据时，训练用内核比对方法嵌入的参数化数据也容易受到指数浓度的影响。我们的结果通过数值仿真来验证几个QML任务。总体而言，我们提供指南，表明应避免某些功能，以确保量子内核方法的有效评估和训练性。

translated by 谷歌翻译

Dynamical simulation via quantum machine learning with provable generalization

Joe Gibbs , Zoë Holmes , Matthias C. Caro , Nicholas Ezzell , Hsin-Yuan Huang , Lukasz Cincio , Andrew T. Sornborger , Patrick J. Coles

分类：机器学习

2022-04-21

作为量子优势的应用，对动态模拟和量子机学习（QML）的关注很大，而使用QML来增强动态模拟的可能性尚未得到彻底研究。在这里，我们开发了一个框架，用于使用QML方法模拟近期量子硬件上的量子动力学。我们使用概括范围，即机器学习模型在看不见的数据上遇到的错误，以严格分析此框架内算法的训练数据要求。这提供了一种保证，就量子和数据要求而言，我们的算法是资源有效的。我们的数字具有问题大小的有效缩放，我们模拟了IBMQ-Bogota上的Trotterization的20倍。

translated by 谷歌翻译

Equivalence of quantum barren plateaus to cost concentration and narrow gorges

Andrew Arrasmith , Zoë Holmes , M. Cerezo , Patrick J. Coles

分类：机器学习

2021-04-12

优化参数化量子电路（PQC）是使用近期量子计算机的领先方法。但是，对于PQC的成本函数景观知之甚少，这阻碍了量子意识到的优化器的进展。在这项工作中，我们研究了PQCS已观察到的三种不同景观特征之间的联系：（1）指数呈指数消失的梯度（称为贫瘠的高原），（2）关于平均值的成本成本集中，以及（3）（3）指数的狭窄小小的（称为狭窄的峡谷）。我们在分析上证明，这三个现象一起出现，即当发生一个现象时，其他两个现象也是如此。该结果的一个关键含义是，可以通过成本差而不是通过计算更昂贵的梯度来数字诊断贫瘠的高原。更广泛地说，我们的工作表明，量子力学排除了某些成本景观（否则在数学上可能是可能的），因此从量子基础的角度来看，我们的结果很有趣。

translated by 谷歌翻译

Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

Harry Coppock , George Nicholson , Ivan Kiskin , Vasiliki Koutra , Kieran Baker , Jobie Budd , Richard Payne , Emma Karoune , David Hurley , Alexander Titcomb

分类：机器学习

2022-12-15

Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata, including reverse transcription polymerase chain reaction (PCR) test outcomes, of whom 23,514 tested positive for SARS CoV 2. Subjects were recruited via the UK governments National Health Service Test-and-Trace programme and the REal-time Assessment of Community Transmission (REACT) randomised surveillance survey. In an unadjusted analysis of our dataset AI classifiers predict SARS-CoV-2 infection status with high accuracy (Receiver Operating Characteristic Area Under the Curve (ROCAUC) 0.846 [0.838, 0.854]) consistent with the findings of previous studies. However, after matching on measured confounders, such as age, gender, and self reported symptoms, our classifiers performance is much weaker (ROC-AUC 0.619 [0.594, 0.644]). Upon quantifying the utility of audio based classifiers in practical settings, we find them to be outperformed by simple predictive scores based on user reported symptoms.

translated by 谷歌翻译

Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19

Davide Pigoli , Kieran Baker , Jobie Budd , Lorraine Butler , Harry Coppock , Sabrina Egglestone , Steven G. Gilmour , Chris Holmes , David Hurley , Radka Jersakova

分类：机器学习

2022-12-15

Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has been interest in using artificial intelligence methods to predict COVID-19 infection status based on vocal audio signals, for example cough recordings. However, existing studies have limitations in terms of data collection and of the assessment of the performances of the proposed predictive models. This paper rigorously assesses state-of-the-art machine learning techniques used to predict COVID-19 infection status based on vocal audio signals, using a dataset collected by the UK Health Security Agency. This dataset includes acoustic recordings and extensive study participant meta-data. We provide guidelines on testing the performance of methods to classify COVID-19 infection status based on acoustic features and we discuss how these can be extended more generally to the development and assessment of predictive methods based on public health datasets.

translated by 谷歌翻译

A large-scale and PCR-referenced vocal audio dataset for COVID-19

Jobie Budd , Kieran Baker , Emma Karoune , Harry Coppock , Selina Patel , Ana Tendero Cañadas , Alexander Titcomb , Richard Payne , David Hurley , Sabrina Egglestone

分类：机器学习

2022-12-15

The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmission of the Alpha and Delta SARS-CoV-2 variants and some Omicron variant sublineages. Audio recordings of volitional coughs, exhalations, and speech were collected in the 'Speak up to help beat coronavirus' digital survey alongside demographic, self-reported symptom and respiratory condition data, and linked to SARS-CoV-2 test results. The UK COVID-19 Vocal Audio Dataset represents the largest collection of SARS-CoV-2 PCR-referenced audio recordings to date. PCR results were linked to 70,794 of 72,999 participants and 24,155 of 25,776 positive cases. Respiratory symptoms were reported by 45.62% of participants. This dataset has additional potential uses for bioacoustics research, with 11.30% participants reporting asthma, and 27.20% with linked influenza PCR test results.

translated by 谷歌翻译

Safe and Smooth: Certified Continuous-Time Range-Only Localization

Frederike Dümbgen , Connor Holmes , Timothy D. Barfoot

分类：机器人

2022-09-09

定位移动机器人的一种常见方法是测量已知位置点的距离，称为锚点。从距离测量值中定位设备通常是由于测量模型的非线性而作为非凸优化问题。当使用局部迭代求解器（如高斯 - 牛顿）时，非凸优化问题可能会产生次优的解决方案。在本文中，我们为连续范围的本地化设计了最佳证书。我们的公式可以整合运动，从而确保溶液的平滑度，并且对于仅从几个距离测量值进行定位至关重要。拟议的证书几乎没有额外的成本，因为它的复杂性与稀疏本地求解器本身的复杂性相同：位置数量的线性。我们在仿真和现实世界数据集中显示，有效的本地求解器通常会找到全球最佳解决方案（通过我们的证书确认），而当没有证书确认时，简单的随机重新初始化最终会导致可认证的最佳选择。

translated by 谷歌翻译

Developing moral AI to support antimicrobial decision making

William J Bolton , Cosmin Badea , Pantelis Georgiou , Alison Holmes , Timothy M Rawson

分类：人工智能 | 机器学习

2022-08-12

辅助抗菌处方的人工智能（AI）提出了重大的道德问题。利用与AI驱动的系统一起利用道德框架，同时考虑特定的复杂性，可以支持道德决策以应对抗菌抗性。

translated by 谷歌翻译

Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding

Connor Holmes , Minjia Zhang , Yuxiong He , Bo Wu

分类：自然语言处理 | 人工智能

2022-06-30

近年来，大型预训练的变压器网络已显示出许多自然语言理解任务的巨大改进。但是，由于延迟和成本限制，这些模型的巨大规模给他们的微调和在线部署带来了重大挑战。支持N：M半结构化的稀疏性和低精油整数计算的新硬件是提高DNN模型效率的有前途解决方案。但是，很少有研究系统地研究预先训练的变压器网络在多大程度上受益于这些技术的组合，以及如何最好地压缩变压器的每个组件。我们提出了一个灵活的压缩框架NXMiformer，该框架使用ADMM和基于Ste的QAT执行同时进行稀疏和量化。此外，我们介绍且廉价的启发式驱动搜索算法，该算法标识了满足压缩比约束的有希望的异质压缩配置。当通过NLU基准测试的胶水套件进行评估时，我们的方法可以达到BERT模型编码器的93％压缩，同时保留了98.2％的原始模型准确性并充分利用硬件功能。异质配置通过搜索启发式发现了基线准确性的99.5％，同时仍将模型压缩为87.5％。

translated by 谷歌翻译

An Efficient Global Optimality Certificate for Landmark-Based SLAM

Connor Holmes , Timothy D. Barfoot

分类：机器人

2022-06-26

现代状态估计通常被表达为优化问题，并使用有效的本地搜索方法解决。这些方法最能保证与本地最小值的融合，但是在某些情况下，全球最优性也可以得到认证。尽管此类全球最佳证书已经为3D姿势格言优化确定了，但是对于基于3D地标的SLAM问题，尚未确定细节，其中估计的状态包括机器人姿势和地图地标。在本文中，我们通过使用图理论方法来解决这一差距，将基于里程碑的SLAM的子问题投入到一种形式，该形式产生了足够的全球最优状态。存在计算这些子问题的最佳证书的有效方法，但首先需要构建大型数据矩阵。我们表明，该矩阵可以以复杂性构建，该复杂性在地标数量中保持线性，并且不超过一个局部求解器的最新计算复杂性。最后，我们证明了证书对基于模拟和现实世界标记的大满贯问题的功效。

translated by 谷歌翻译